Scaling up knowledge graph creation to large and heterogeneous data sources

نویسندگان

چکیده

RDF knowledge graphs (KG) are powerful data structures to represent factual statements created from heterogeneous sources. KG creation is laborious and demands management techniques be executed efficiently. This paper tackles the problem of automatic generation processes declaratively specified; it proposes for planning transforming into triples following mapping assertions specified in Mapping Language (RML). Given a set assertions, planner provides an optimized execution plan by partitioning scheduling assertions. First, assesses number partitions considering sources, type associations between different After providing list that belong each partition, determines their order. A greedy algorithm implemented generate partitions’ bushy tree plan. Bushy plans translated operating system commands guide order indicated tree. The proposed optimization approach evaluated over state-of-the-art RML-compliant engines, existing benchmarks sources RML maps. Our experimental results suggest performance studied engines can considerably improved, particularly complex setting with numerous maps large As result, time out cases enabled produce at least portion applying planner.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Scaling Access to Heterogeneous Data Sources with DISCO

Accessing many data sources aggravates prob lems for users of heterogeneous distributed databases Database administrators must deal with fragile mediators that is mediators with schemas and views that must be sig ni cantly changed to incorporate a new data source When implementing translators of queries from mediators to data sources database implementors must deal with data sources that do not...

متن کامل

Semantic Knowledge Discovery from Heterogeneous Data Sources

Available domain ontologies are increasing over the time. However there is a huge amount of data stored and managed with RDBMS. We propose a method for learning association rules from both sources of knowledge in an integrated way. The extracted patterns can be used for performing: data analysis, knowledge completion, ontology refinement.

متن کامل

Managing Data from Heterogeneous Data Sources Using Knowledge Layer

In the process of data integration using ontologies it is important to manage data from external data sources in the same way as data stored in the Knowledge Base. In previous papers [1], [2] the way of inference from data stored in the Knowledge Base, using Knowledge Cartography idea has been presented. However, this solution requires loading all data to the Knowledge Base. The solution presen...

متن کامل

Scaling Up Radial Graph Layouts

The well-known radial graph layout technique has plenty of advantages for graph visualization and graph exploration, but is quite limited in the size of graphs it can display effectively because the layout is inherently global and allows far away or out of view nodes to distort the focus region. The primary contribution of this project is to introduce sub tree clustering to allow the layout alg...

متن کامل

Text Categorization with Knowledge Transfer from Heterogeneous Data Sources

Multi-category classification of short dialogues is a common task performed by humans. When assigning a question to an expert, a customer service operator tries to classify the customer query into one of N different classes for which experts are available. Similarly, questions on the web (for example questions at Yahoo Answers) can be automatically forwarded to a restricted group of people with...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Web Semantics

سال: 2023

ISSN: ['1570-8268', '1873-7749']

DOI: https://doi.org/10.1016/j.websem.2022.100755